首页> 外文OA文献 >Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

【2h】

Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

机译：使用Raw的音乐自动标记的样本级CNN架构波形

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent work has shown that the end-to-end approach using convolutional neuralnetwork (CNN) is effective in various types of machine learning tasks. Foraudio signals, the approach takes raw waveforms as input using an 1-Dconvolution layer. In this paper, we improve the 1-D CNN architecture for musicauto-tagging by adopting building blocks from state-of-the-art imageclassification models, ResNets and SENets, and adding multi-level featureaggregation to it. We compare different combinations of the modules in buildingCNN architectures. The results show that they achieve significant improvementsover previous state-of-the-art models on the MagnaTagATune dataset andcomparable results on Million Song Dataset. Furthermore, we analyze andvisualize our model to show how the 1-D CNN operates.

机译：最近的工作表明，使用卷积神经网络（CNN）的端到端方法在各种类型的机器学习任务中都是有效的。对于音频信号，该方法使用一维卷积层将原始波形作为输入。在本文中，我们通过采用最新的图像分类模型，ResNets和SENets的构建基块，并在其中添加多级特征聚合，来改进用于音乐自动标记的一维CNN架构。我们比较了BuildingCNN架构中模块的不同组合。结果表明，与以前在MagnaTagATune数据集上的最新模型相比，它们具有显着的改进，在“百万首歌”数据集上具有可比的结果。此外，我们对模型进行分析和可视化以显示一维CNN的工作方式。

著录项

作者
Kim, Taejun; Lee, Jongpil; Nam, Juhan;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. A sample-level DCNN for music auto-tagging [J] . Yu Yong-bin, Qi Min-hui, Tang Yi-fan, Multimedia Tools and Applications . 2021,第8期

机译：用于音乐自动标记的示例级别DCNN
2. Efficient Music Auto-Tagging with Convolutional Neural Networks [J] . Shaleen Bengani, S. Vadivel, J. Angel Arul Jothi Journal of computer sciences . 2019,第8期

机译：卷积神经网络的高效音乐自动标记
3. Music auto-tagging based on the unified latent semantic modeling [J] . Shao Xi, Cheng Zhiyong, Kankanhalli Mohan S. Multimedia Tools and Applications . 2019,第1期

机译：基于统一潜在语义建模的音乐自动标记
4. Sample-Level CNN Architectures for Music Auto-Tagging Using Raw Waveforms [C] . Taejun Kim, Jongpil Lee, Juhan Nam IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：使用原始波形的音乐自动标记的示例级CNN架构
5. Content-Based Music Recommendation with the LFM-1b Dataset and Sample-Level Deep Convolutional Neural Networks [D] . Platt, Devin. 2017

机译：具有LFM-1b数据集和样本级深度卷积神经网络的基于内容的音乐推荐
6. Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing [O] . Md Jubaer Hossain Pantho, Pankaj Bhowmik, Christophe Bobda 2021

机译：迈向有效的CNN推理架构实现了传感器处理
7. A hybrid CNN-LiGRU acoustic modeling using raw waveform sincnet for Hindi ASR [O] . ANKIT KUMAR, Rajesh Kumar Aggarwal 2020

机译：一种使用原始波形SINCNET进行HINDI ASR的混合CNN-LIGRU声学建模

Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms

摘要

著录项

相似文献

相关主题

期刊订阅